Dynamic Document Delivery: Generating Natural Language Texts on Demand
نویسندگان
چکیده
Research in natural language generation promises significant advances in the ways in which we can make available the contents of underlying information sources. Most work in the field relies on the existence of carefully constructed artificial intelligence knowledge bases; however, the reality is that most information currently stored on computers is not represented in this format. In this paper, we describe some work in progress where we attempt to generate large numbers of texts automatically from existing underlying databases. We focus here in particular on the automatic generation of descriptions of objects stored in a museum database, highlighting the difficulties that arise in using a real data source, and pointing to some possible solutions.
منابع مشابه
Plagiarism checker for Persian (PCP) texts using hash-based tree representative fingerprinting
With due respect to the authors’ rights, plagiarism detection, is one of the critical problems in the field of text-mining that many researchers are interested in. This issue is considered as a serious one in high academic institutions. There exist language-free tools which do not yield any reliable results since the special features of every language are ignored in them. Considering the paucit...
متن کاملAdding Syntax to Dynamic Programming for Aligning Comparable Texts for the Generation of Paraphrases
Multiple sequence alignment techniques have recently gained popularity in the Natural Language community, especially for tasks such as machine translation, text generation, and paraphrase identification. Prior work falls into two categories, depending on the type of input used: (a) parallel corpora (e.g., multiple translations of the same text) or (b) comparable texts (non-parallel but on the s...
متن کاملEvaluating integrated NLP in foreign language learning: technology meets pedagogy
In this paper I would like to present the pedagogical perspective upon a 3-year research project with direct implications for the field of Foreign Language Teaching (FLT), and English for Professional Purposes (ESP) in particular. As a project in Computer Science, the basic aim of LARFLAST 1 was to investigate the integration of several innovative software components into a harmonized environme...
متن کاملDiscourse Factors in Multi-Document Summarization
The over-abundance of information today, especially online, has established the need for natural language technologies that can help the user find relevant information; multidocument summarization (MDS) and question answering (QA) are two examples. The requirement in MDS and openended QA to produce multi-sentential answers imposes the extra demand that the output of such systems be a coherent d...
متن کاملAutomatic Acquisition of Parallel Corpora from Websites with Dynamic Content
Parallel corpora are indispensable resources for a variety of multilingual natural language processing tasks. This paper presents a technique for fully automatic construction of constantly growing parallel corpora. We propose a simple and effective dictionary-based algorithm to extract parallel document pairs from a large collection of articles retrieved from the Internet, potentially containin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998